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Abstract Three algorithms looking for pretty large partial Hadamard ma- 
trices are described. Here "large" means that hopefully about a third of a 
Hadamard matrix (which is the best asymptotic result known so far, [5]) is 
achieved. The first one performs some kind of local exhaustive search, and 
consequently is expensive from the time consuming point of view. The second 
one comes from the adaptation of the best genetic algorithm known so far 
searching for cliques in a graph, due to Singh and Gupta [2 b. The last one 
consists in another heuristic search, which prioritizes the required processing 
time better than the final size of the partial Hadamard matrix to be obtained. 
In all cases, the key idea is characterizing the adjacency properties of vertices 
in a particular subgraph Gt of Ito's Hadamard Graph A(At) [THj, since cliques 
of order m in Gt can be seen as (m + 3) x At partial Hadamard matrices. 
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1 Introduction 

Hadamard matrices consist in {1, — l}-square matrices whose rows are pairwise 
orthogonal. This nice property makes Hadamard matrices being objects for 
multiple applications (see [13] and [14] for instance). 

It may be straightforwardly checked that such a matrix must be of size 1, 2 
or a multiple of 4. The Hadamard Conjecture claims that a matrix of this type 
exists for every size multiple of 4. Many attempts have been devoted to prove 
this conjecture (both from a constructive way |14| and also from a theoretical 
point of view in terms of asymptotic results of existence [51111] ) , but it remains 
unsolved so far. 

From the practical point of view, taking into account possible applica- 
tions, sometimes there is no need to consider a full Hadamard matrix. In fact, 
it suffices to meet a large amount of pairwise orthogonal rows [2J. This has 
originated the interest in constructing partial Hadamard matrices PH, that 
is, m X n (1, — l)-matrices PH satisfying PH ■ PH T = nl m , for m < n. We 
call m the depth of PH. 

From the orthogonality law, it is readily checked that the number n of 
columns must be 1 , 2 or a multiple of 4. 

Notice that a partial Hadamard matrix does not need to be a submatrix of a 
proper Hadamard matrix (for instance, cliques listed in Table [3] for 7 < t < 10 
are maximal but not maximum; in the sense that although no larger cliques 
exist containing them, there exist larger cliques, for example, those related to 
full Hadamard matrices). 

Although partial Hadamard matrices are as useful as Hadamard matrices 
themselves with regards to practical purposes, unfortunately it seems that 
their explicit construction is equally hard as well. 

De Launey proved in [5] that partial Hadamard matrices of size about 
a third of a At x At Hadamard matrix exist for large t. The proof gives a 
polynomial time algorithm in t for constructing such a matrix. Furthermore, 
De Launey and Gordon proved in [10] that about a half of a Hadamard matrix 
At x At exists for large t, assuming that the Riemann hypothesis is true. The 
idea is decomposing 2t — i as the sum of i odd prime numbers p^ 2 < i < 3, so 
that the juxtaposition of the corresponding Paley conference matrices provides 
a partial Hadamard matrix of depth 2 min{pi}+2. Unfortunately, none of these 
methods can provide a partial Hadamard matrix of depth greater than half of 
a full Hadamard matrix. 

In this paper, we present three new algorithms for constructing partial 
Hadamard matrices of size m x At. The first one performs some kind of local 
exhaustive search, and consequently is expensive from the time consuming 
point of view. The second one comes from the adaptation of the best genetic 
algorithm known so far searching for cliques in a graph, due to Singh and 
Gupta |21) . The third one consists in another heuristic search, which prioritizes 
the required processing time better than the final size of the partial Hadamard 
matrix obtained so far. The idea is looking for large cliques (i.e. subgraphs 
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whose vertices are pairwise adjacent) in a subgraph Gt of Ito's Hadamard 
Graph A(4t) [H]. 

Although the results showed in [8] and |10j are impressive and meaningfully 
better than any obtained from the algorithms described in this paper, it is a 
remarkable fact that our algorithms may provide partial Hadamard matrices of 
depth greater than half of a full Hadamard matrix. Furthermore, it is possible 
(and desirable) to run our algorithms taking as input data partial Hadamard 
matrices obtained by the procedures in [811X0] , so that deeper partial Hadamard 
matrices are constructed (see Table I4.2[) . 

The paper is organized as follows. 

The graph Gt and its properties are described in Section 2. Section 3 is 
devoted to the description of the local exhaustive algorithm looking for cliques 
in Gt- In Section 4, the two heuristics searching for cliques in Gt are described. 
Last section is devoted to conclusions. 

2 The graph G t 

In what follows, for clarity in the exposition, we will simply use + and — 
instead of 1 and — 1. 

Hadamard Graphs were introduced by Ito in [T5] . Originally they referred 
to the graph A(4t) whose vertices are the (1, —1)- vectors of length At consisting 
of an even number of Is. The adjacency relation consists in orthogonality. 

We call Hadamard graph to the subgraph G t of A(4i) induced by the 
(1, — l)-vectors simultaneously orthogonal to the three first rows of a normal- 
ized Hadamard matrix, 



/+...+ 

+ . . . + 


+ ... + 
+ ... + 


+ ... + 


+ ...+X 


+ . . . + 
V ••• 




+ ... + 


... 1 



These orthogonality conditions straightforwardly characterize the form of 
the vertices in Gt- 

Lemma 1 The vertices of Gt consist of (I, — \)-vectors of length At where the 
2t negative entries are distributed following this pattern, 

| k | t-k | t-k | k | 

t t t t 

so that exactly k, t — k, t — k and k negative entries occur among every t 
positions, for some < k < t. 

We may then classify the set of vertices in Gt attending to the number 
k of negative entries which appear in positions 1 through t. In what follows, 
a /c-vertex in Gt refers to a vertex with precisely k negative entries among 
positions 1 to t. 
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Lemma 2 In particular, the number of vertices in Gt is \Gt \ = 

fc=0 

The tables below give the number of fc-vertices (n.v.) and their degree S 
(number of adjacent vertices), for 1 < t < 7 and < k < t. The column total 
refers to the number of vertices and edges in Gt (notice that the number of 
edges is half the summation of the degree of every vertex) . It is evident that 
for a fixed value of fc, every k- vertex has the same degree (since permuting 
some columns does not affect to the orthogonality relation). 




Table 1 Vertices and edges in Gt- 
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21252 



73440000 
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263844 
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160000 
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n.v. 
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2401 


194481 


1500625 


1500625 


194481 


2401 
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3395016 
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960000 


960000 


960000 


960000 


960000 


960000 





1629606720000 



It is readily checked that the size of Gt grows exponentially on t. Since 
cliques of size m in Gt translate to partial Hadamard matrices (m + 3) x At, we 
would like to search for large cliques in Gt- Since the largest clique in A(At) is 
at most of size At (see [17] for details), the largest clique in Gt is at most of size 
At — 3. Cliques meeting the upper bound would correspond to full Hadamard 
matrices. 

Example 1 For instance, consider the graph G*2, obtained from t — 2. The 
picture below shows a clique of size 5, so that adding the three normalized 
rows we obtain a full Hadamard matrix of size 8x8. 
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A maximum clique is a clique with the maximum cardinality (which is 
called the maximum clique number). This notion is different from that of 
maximal clique, which refers to a clique which is not a proper subset of any 
other clique. Thus maximal cliques need not be maximum ones (as we had 
already noticed in the introduction), though the converse is always true. 

Given a graph, the maximum clique problem (MCP) is to find a maximum 
clique, and it is NP-complete [BJ. Unfortunately, there is no polynomial-time 
algorithm for approximating the maximum clique within a factor of n l ~ t unless 
P=NP [T2], where n is the number of the vertices of the graph. Moreover, there 
is no polynomial-time algorithm approximating the clique number within a 

factor of - — %: — unless NP=ZPP \m. 
(log n) L ~ e 

Anyway, our purpose here is to design an algorithm for constructing suf- 
ficiently large cliques in G t , at least of depth greater than a third of a full 
Hadamard matrix. To this end, we need to study the properties of Gt in a 
more detailed way. 

In what follows, for brevity, we will adopt the additive notation for rep- 
resenting Hadamard matrices, so that the Is turn to Os and the —Is turn to 
Is. 

This way, fc-vectors in Gt are now described as (0, l)-vectors of length At 
consisting of precisely 2t ones (and hence 2t zeros), which are distributed in 
the following way: there are exactly k ones in positions 1 through t, other t — k 
ones in positions t+l through 2t, another t — k ones in positions 2t + 1 through 
3t, the last k ones being located in positions 3t + 1 through At. 

Each of these A:-vectors may be straightforwardly codified as an integer, 
assuming that the fc-vector is the binary representation of a decimal number. 
Therefore cliques are codified as lists of integers, each of them being the deci- 
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mal representation of a binary number consisting of 2t ones and length less or 
equal to 4t. 

Lemma 3 Actually, it can be assumed that < k < [— J . 

Proof This is a straightforward consequence of the fact that the negation of a 
row does not affect to the set of its orthogonal vectors. 

■ 

That being so, the full set of vertices adjacent to a given fc-vertex v may 
be obtained by calculating those s- vectors w orthogonal to v, for < s < |_f J , 
and then adding their complements. Notice that with the additive notation at 
hand, a fc-vector v and a s-vector w are orthogonal if and only if they share 
exactly 2t bits. 

We now describe a procedure for determining a set <5 V of generators for the 
adjacency list related to a fixed fc-vector v, that is, generating those s- vectors 
w orthogonal to v. 

In order to compute the total amount of coincidences between v and w, 
one may split the vectors by quarters, and count the amount i of 1-bits (for 
the first and last quarters) and 0-bits (for the second and third quarters) that 
these vectors share in each of these basic quarters. 

Lemma 4 At the first and fourth (resp., the second and third) quarters the 
number i of coincidences in Is (resp, in Os) runs in the range [max(0, s + k — 
t), min(fc, s)] . Actually, assuming the conditions in Lemma\3[ i £ [0, min(fc, s)]. 

Lemma 5 At each quarter, the number cti of total coincidences (both in Is 
and 0s) satisfies cti = t— s— k + 2i, and runs in the range [\t — k — s\,t— \ s — k\]. 
Actually, assuming the conditions in Lemma\^ ai € [t — s — k, t — \s — k\] . 

Proof Assume that at a given quarter the vectors v and w share exactly i 
1-bits. Then w has k — i 0-bits in those positions where the remaining 1-bits 
of v are located. Analogously, v has s — i 0-bits in those positions where the 
remaining 1-bits of w are located. Thus v and w share exactly t ~ i — (k — 
i) — (s — i) = t — k — s + i 0-bits. Since v and w share i 1-bits and t — k — s + i 
0-bits, the total number of coincidences is 014 = t — k — s + 2i. 

m 

Corollary 1 a J+ i = ai + 2. 

Let denote n = min(/c,s). In the conditions above, the set of total coin- 
cidences is given by t — s — k = ao < ... < a n = t — \s — k\. We may now 
describe the set of s-vectors adjacent to a given fc-vector. 

Proposition 1 The set of vectors orthogonal to a given k-vector corresponds 
to the full set of distributions of vectors satisfying tuples of total coincidences 
(a i± , a i2 , a i3 , a u ) such that = 2t. 
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Proposition 2 The set of tuples {on 1 , , ot.i 3 , ) which give rise to orthog- 
onal s-vectors are characterized as the solutions of the following system of 
diophantine equations 

x a + . . . + x n a n = 2t 
x Q + ...+ x n = 4 (1) 

i,£Z: < x t < 4 

Here, n = min(fc, s) and Xi indicates how many coincidences of the type on 
must occur among the four quarters. 

We now give a constructive way to solve the system above. 

Proposition 3 There exists a solution for the system flj) iff Aolq < 2t < 4a n . 

Proof From Corollary [I] we know that the difference between two consecutive 
on, is 2. 

Since 4ao, 2t and 4a n are even and taking into account Corollary [TJ we 
conclude that every even number in the range [4«0j 4a ra ] may be (not uniquely, 
in general) written as a combination oti t + c^ 2 + aj 3 + for some values 
ao < a ix < a l2 < a i3 < a i4 < a n . 

m 

The condition above may be straightforwardly generalized for the case of 
solutions related to tuples of the type (a^, on 2 , aj 3) a^), on 1 < a.i 2 < a{ 3 < 
a u . 

Corollary 2 Fixed t and < k, s < |_|J , the set sol of solutions to the system 

(QP may be constructed in the following way: 

sol ^r- 

ao t — k — s 

a„ <— t — \k — s\ 

for i\ from max{ai, It — 3a„} to min{a ra , } with step 2 do 

for ^2 from max{ii, 2t — i\ — 2a n } to min{a ra , L 2 *^ 11 J } with step 2 
do 

for «3 from max{i2, 2t — i\ — i2 — a n } to min{a n , [ 2 *~' 2 1 ~ t2 j| with 
step 2 do 

sol <— sol U {{ii,i 2 , «3, 2t — zi — « 2 — 23}} 

od 

od 

od 

Given a tuple (a 2l , cn 2 , oti 3 , a^ 4 ) solution to ([lj, construct the four matrices 
Nk whose rows are those vectors satisfying ai k total coincidences with the 
corresponding quarter of v. By construction, the juxtaposition of any of the 
rows of these matrices gives a vector orthogonal to v. 

Proposition 4 A set S v of generators for the adjacency list of v may be 
straightforwardly constructed in terms of matrices of the type above. 
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In spite of the fact that the size (both in edges and vertices) of Gt grows 
exponentially in t, the procedure described in Proposition 2] is a cheaper way 
(in terms of both time and space) of saving this information. 

We will illustrate now how the proposition above works by means of an 
example. In order to simplify the reading, in what follows we will use a vertical 
line | to separate the different quarters of a &;-vector. 

Example 2 Let us consider the case t = 5, k = 2 and v = 684646 (its binary 
representation gives the 2-vector v = (10100|11100|10011|00110)). We are go- 
ing to calculate the full set of s-vectors orthogonal to v, for < s < 2 = [f J 
(recall that the remaining orthogonal vectors are obtained by simply inter- 
changing the 0-bits and 1-bits, since they are the negation of the vectors just 
calculated) . 



1. Case s — 0. 

From Lemma 21 we know that there is just one value for the number i of 
coincidences in 1-bits, namely i = 0. Consequently, there is just one value 
ai, namely cto = 3. 

Since 4ao = 12 7^ 10 = 2 • 5, the system (TTJ) has no solutions, so there is no 

0- vector orthogonal to v. 

This was evident from the very beginning, since there is only one 0- vector, 
w = 32736 (i.e. w = (00000|11111|11111|00000)), and it is not orthogonal 
to v. 

2. Case s = 1. 

From Lemma HI we know that i € {0, 1}, and hence 04 £ {2,4}. The set 
of different ways in which four may be order less chosen to sum 10 is 
described by the system ([T]), 

f 2x + 4xi = 10 
\ x + xx = 4 

Since 4«o = 8 < 10 < 4ax = 16, there exist solutions for the system. In 
fact, there is just one solution, (xq,Xx) — (3, 1), which corresponds to the 
following distribution of total coincidences (up to reordering): (2,2,2,4). 
In order to explicitly construct those 1-vectors w meeting the distribution 
(2,2,2,4), we have to find those 1-vectors with coincidences with v in 

1- bits in the first quarter, coincidences in 0-bits in the second and third 
quarters, and 1 coincidence in 1-bits in the fourth quarter. 

/5 - 2\ 

Since v is a 2-vector, in the first quarter there are ( ^ ) cri °i ces f° r Pac- 
ing the 1-bit of w among the 0-bits of v. Analogously, there are 
choices for placing the 0-bit of w among the 1-bits of v in the second and 
third quarters. And there are ( ^ ~\ choices for placing the 1-bit of w among 
the 1-bits of v. 
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In conclusion, the set of 1-vectors meeting the distribution of total coin- 
cidences (2, 2, 2, 4) is generated by the juxtaposition of any of the rows of 
the following matrices 



10 
1 
1 



1111 
10 111 
110 11 



1111 
1110 1 
11110 



10 
1 



Similar schemes are achieved with the remaining orderings of the valid 
distribution of total coincidences, (2,2,4,2), (2,4,2,2) and (4,2,2,2). In 
fact, they may be obtained by simply permuting and/or negating some 
suitable columns of the distribution above. 
3. Case s = 2. 

From Lemma 21 we know that i £ {0, 1, 2}, and hence a, £ {1, 3, 5}. The 
set of different ways in which four a,; may be ordcrless chosen to sum 10 is 
described by the system ([1]), 



J xq + 3xi + 5x2 = 10 
\ x + x\+ x 2 = 4 



Since 4ao = 4 < 10 < Aa 2 — 20, there exist solutions for the system. In 
fact, there are just two solutions, (xo, 2:1,22) £ {(2, 1, 1), (1, 3, 0)}, which 
correspond to the following distribution of total coincidences (up to re- 
ordering): (1,1,3,5) and (1,3,3,3). 

In order to explicitly construct those 2- vectors w meeting the distribution 
(1,1,3,5), we have to find those 2- vectors with coincidences with v in 
1-bits in the first quarter, coincidences in 0-bits in the second quarter, 1 
coincidence in 0-bits in the third quarter, and 2 coincidences in 1-bits in 
the fourth quarter. 

/5 - 2\ 

Since v is a 2- vector, in the first quarter there are ( 2 ) cn °i ces f° r pl ac ~ 

(h - 2 

ing the 1-bits of w among the 0-bits of v. Analogously, there are I _ 
choices for placing the 0-bits of w among the 1-bits of v in the second 
quarter. Analogously, in the third quarter, there arc ( ^ choices for plac- 
ing one 0-bit of w among the 0-bits of v, and for each of them, there are 
'5-2 s 



1 



choices for placing the second 0-bit of w among the 1-bits of v. 



Finally, there are ( fj choices (just one!) for placing the 1-bits of w among 
the 1-bits of v. 

In conclusion, the set of 2-vectors meeting the distribution of total coin- 
cidences (1, 1, 3, 5) is generated by the juxtaposition of any of the rows of 
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the following matrices 



1 1\ / 1 1 1 
01001 x 01011 I / 
01010/ \00111 



/l 1 1 0\ 
10 10 1 
111 
110 10 
110 1 

\0 1 1 I J 



x (0 1 1 0) 



Similar schemes are achieved from the remaining orderings of the distribu- 
tion (1, 1, 3, 5). 

Analogously, one may conclude that the set of 2— vectors meeting the distri- 
bution of total coincidences (1, 3, 3, 3) (and in a similar way its reorderings 
as well) is generated by the juxtaposition of any of the rows of the following 
matrices 



1 1 
1 1 | * 
10 10 



(1 1 1\ 
10 10 1 
110 1 
110 10 
10 110 

\0 1 1 1 0/ 



(I 1 1 0\ 
10 10 1 
111 
110 10 
110 1 

\0 1 1 1/ 



/0 1 1\ 
110 
10 10 
1 1 
10 10 

\1 1 0/ 



□ 



We have just described a procedure for determining a set S v of generators 
for the adjacency list related to a fixed A:- vector v, < k < [|J- In spite of 
the fact that the size (both in edges and vertices) of G t grows exponentially 
in t, this is a cheaper way (in terms of both time and space) of saving this 
information. 

Once we know how the adjacency relation in G t looks like, we are in con- 
ditions to describe an algorithm looking for cliques in G t . 



3 Local exhaustive search 

From the results described in the precedent Section, one may straightforwardly 
design an algorithm searching for a maximal (probably not maximum) clique 
in G t - Starting from a clique C initially consisting of a random vertex v in 
Gt, it suffices to repeatedly add a new vertex w to C, randomly chosen among 
those vertices simultaneously orthogonal to the vertices already in C. 

This is somehow a local exhaustive search. Exhaustive, in the sense that 
repeatedly the full set of vertices simultaneously orthogonal to the nodes of a 
given clique is constructed. Local, in the sense that just one vertex among the 
full set of candidates is actually used. 

We next include a pseudo-code for this algorithm. 
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Algorithm 1 Searching for cliques in Gt- 

Input : an integer t 

Output : a maximal clique in Gt 

Select a fc-vertex v in Gt 

clique 4— {v} 
adj <— S v 

while adj is not empty{ 

v <— a random s- vert ex in adj 

clique <— clique U {v} 
adj <— adj n i5 v 

} 

clique 

The complexity of the algorithm relays on the operations 6 V and adj D 5 V • 
From Proposition[3l S v may be constructed in polynomial-time. Unfortunately, 
the size of adj increases exponentially, and so the algorithm is both time and 
space consuming. 

Improved versions of the algorithm might be implemented, depending on 
a deeper knowledge of how orthogonality on fc-vertices works. 
We now include some execution tables. 

All the calculations of this section have been worked out in Mathematica 
4.0, running on a Pentium IV 2.400 Mhz DIMM DDR266 512 MB. 

The table below shows, for every 2 < t < 10, the number of essays which 
have been executed looking for cliques in Gt, the average time required in 
these calculations, the average size of these cliques, the largest size found so 
far, the time required in this calculation and one instance among the largest 
cliques already found. 



Table 2 Results obtained from Algorithm 1. 



t 


Essays 


Av.Time 


Av.Sizc 


Lar.S. 


Time 


Clique 


2 


10 


0.0232" 


5 


5 


0.016" 


166,101,106,169,60 


3 


10 


0.039" 


9 


9 


0.031" 


2396, 730, 881, 940, 1386, 2482, 1433, 
2281, 1268 


1 


10 


0.368" 


13 


13 


0.327" 


22166, 22874, 11698, 34776, 13251, 
19428, 49971, 13116, 39594, 43606, 
38246, 7793, 26218 


5 


10 


0.369" 


11 


17 


0.468" 


615882, 81761, 124596, 682665, 
315178, 323811, 183761, 808837, 
405190, 572306, 350936, 677490, 
193356, 813428, 219558, 406968, 
564460 
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t 


Essays 


Av.Timc 


Av.Sizc 


Lar.S. 


Time 


Clique 














1943217, 2913196, 4896610, 1866067, 














13689735, 5584078, 10118748, 4943241, 


6 


10 


4.128" 


13.8 


21 


3.198" 


3324617, 6759253, 5685752, 8874722, 


14710001, 8830356, 6614554, 4840148, 
8771401, 3272036, 3364754, 13712184, 
11117018 














30121113, 153038560, 15673110, 














73300362,41266505, 117929485, 


7 


10 


12.19" 


12.8 


17 


20.53" 


86698833, 43440080, 105500358, 


176876844 41595557 156361628 
172841777, 119125298, 205299228, 
79251369, 210612677 














1305024466, 1295442290, 2999574898, 














3795589971, 3631000788, 3248355466, 


8 


4 


1'39" 


13.75 


15 


59.37" 


364489541, 2896517604, 2629324346, 
1255832164, 504314142, 3570677031, 
965647459, 1387503024, 1483592489 














56453847575 4307443^680 














41936051018 22138975396 














23156633420 41987793978 


9 




13'46" 


14.25 


16 


6'34" 


37225385892, 18981806746, 


4 


45452914326, 7371644105, 
53995320335, 52910835425, 
60376161688, 19823315036, 
10578923147, 28478917930 














611551599738, 585720653408, 














381375537029, 727566346115, 














90629862411, 851419790210, 


10 




4 h 50'6" 


16 


16 


3 h 0'50" 


418284730924, 524680378162, 


2 


364580931042, 154503136551, 
747357767564, 199828304181, 
99982406086, 319408643342, 
422206490833, 34030595756 



As the table below shows, it is remarkable that the sizes of the largest 
cliques found so far are greater than [yj — 3 (a third of a full Hadamard 
matrix of size At) and even 2£ — 3 (a half of a full Hadamard matrix of size 
4i), which are the best asymptotic bounds on the depth of partial Hadamard 
matrices known so far. 



Table 3 Comparing sizes of cliques. 



t 


2 


3 


4 


5 


6 


7 


8 


9 


10 


LfJ-3 


-1 


1 


2 


3 


5 


6 


7 


9 


10 


2t - 3 


1 


3 


5 


7 


9 


11 


13 


15 


17 


Lar.S. 


5 


9 


13 


17 


21 


17 


15 


16 


16 



Although our procedure gives large partial Hadamard matrices (of depth 
greater than half of a full Hadamard matrix) , it is very expensive both in space 
and time. 
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It would be desirable to find a way to design a procedure running signifi- 
cantly faster and which nevertheless leads to large partial Hadamard matrices 
as well. We describe such an algorithm in the following section. 



4 Heuristic searches 

In this section we describe two heuristics for searching for cliques in G t . 

The first of them is a straightforward adaptation of the best (as far as 
we know) genetic algorithm for solving the maximum clique problem (MCP). 
Since it is very expensive in time, we then describe a second heuristic, which 
is much faster, in exchange of precision (in the sense that not sufficiently large 
cliques are obtained). 

Nevertheless, it is a remarkable fact that this last procedure admits as 
input data a clique already constructed. Consequently, one could eventually 
obtain a larger clique. Experimental results suggest that this actually happens 
more times than one could initially think (see Table |4~2")) . In fact, it seems to 
be a good idea to combine this fast search with other more precise procedures. 



4.1 Classical GAs for MCP 

As we commented before, finding the maximum clique of a graph is a NP- 
Hard problem, and consequently all known exact algorithms for this problem 
will run in time that grows exponentially with the number of vertices in the 
graph. This makes these algorithms infeasible even in case of moderately large 
problem instances. Therefore most of the efforts to solve the maximum clique 
problem are based on heuristic approaches. 

In [21] a heuristic based steady-state genetic algorithm for the maximum 
clique problem is described. The steady-state genetic algorithm generates 
cliques, which are then extended into maximal cliques by the heuristic. Af- 
ter comparison with the three best evolutionary approaches for the maximum 
clique problem, they find out that their algorithm outperforms all the three 
evolutionary approaches in terms of best and average clique sizes found on 
majority of DIMACS benchmark instances (which are the canonical family of 
graphs used to test MCP-algorithms) . 

The main features of the genetic algorithm in |21| consist in: 

— Chromosome representation. A n-lcngth bit vector represent a chromosome 
(i.e. a clique), so that a value of 1 at the ith position indicates that the 
vertex i is in the clique. 

— Crossover. They use fitness based crossover, so that a child is constructed 
bit by bit, receiving each time the bit from one of its parents, with prob- 
ability proportional to the fitness of the parents. The vector obtained so 
far may not be a clique, so a repairing function is needed to transform the 
child into a clique. 
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— Repair. First, all 1 bits corresponding to vertices with significantly low 
degree (in comparison with the provisional fittest individual) are changed 
to 0. Then, the repairing procedure introduced by Marchiori in |20j is used, 
so that repeatedly a vertex of the child is selected at random, and either 
it or all the vertices not adjacent to it are deleted (i.e. the corresponding 
bits are fixed to 0), until the child becomes a clique. 

— Mutation. Mutation consists in simple bit flip mutation, where each bit in 
the chromosome is flipped with a pre- fixed probability p m . Once again, the 
repairing function is needed to guarantee that a valid chromosome (i.e. a 
clique) is obtained. 

— Extension. Once a valid chromosome (i.e. a clique) is constructed, an ex- 
tension function is applied, in order to extend the given clique to a maximal 
clique. The idea is repeatedly adding a vertex with highest degree among 
the set S of vertices simultaneously adjacent to the vertices already in the 
clique. When the size of the set S is small enough, then the exhaustive 
search of Carraghan and Pardalos [7] is performed. 

— Fitness. The fitness of a chromosome is equal to the size of the clique that 
it represents. 

— Selection. They use binary tournament selection, where the candidate with 
better fitness is selected with a pre-fixed probability p\>. 

— Replacement policy. No duplicate chromosomes are permitted in the pop- 
ulation. When a new child is constructed, it always replaces the worst 
member of the population, irrespective of its own fitness. 

It seems natural trying to adapt this algorithm to our case. Unfortunately, 
one cannot afford to explicitly construct the adjacency lists of Gt, for values 
t > 5, since they grow exponentially in t, as we showed in the section before. 

So the extension function described in |21j cannot be applied in the case 
of the graph Gt- Nevertheless, we can use instead the algorithm proposed in 
the previous section. 

Algorithm 2 GA looking for cliques in Gt- 

Substitute the extension function in \21}/ by Algorithm 1. 

We show now some executions for 2 < t < 9, where the size of the popula- 
tion is fixed to 5 and the maximum number of generations is fixed to 20. 

Table 4 Results obtained from Algorithm 2. 



t 


Time 


# generations 


#{C : \C\ is maximal} 


\c\ 


PH size 


2 


0, 171" 





5 


5 


8 


3 


0, 359" 





5 


9 


12 


4 


1,872" 





5 


13 


16 


5 


3,931" 





3 


17 


20 


6 


19,36" 





3 


21 


24 


7 


4' 10" 


20 


3 


17 


20 


8 


24'37" 


20 


1 


21 


24 


9 


4 h 36" 


20 


1 


18 


21 
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We next include explicitly the cliques found so far. For brevity, we give 
the decimal number representations of the binary 4i-vectors which form each 
clique. 

Table 5 Cliques obtained from Algorithm 2 listed explicitly. 



t 


Clique 


2 


89, 169, 195, 149, 101 


3 


2396, 922, 3347, 3214, 2635, 2290, 2854, 2473, 1386 


4 


11633, 27850, 20148, 40089, 29719, 6098, 14940, 43414, 22981, 40038, 15011, 42693, 13740 


5 


355028, 694485, 626266, 436558, 57188, 250510, 647277, 308937, 189682, 
809194, 832054, 113080, 211555, 381267, 603590, 684888, 584579 


6 


9162129, 15599647, 12071353, 7947574, 2834188, 10323501, 6722981, 7449237, 
5936794, 3845219, 2986835, 13186247, 14739211, 5822797, 5711193, 13280116, 
14785656, 11687509, 9747734, 11764174, 13835043 


7 


206229172, 136297864, 145421137, 239938137, 244421436, 78464909, 111023826, 
96903644, 49849529, 150232156, 47868686, 213067927, 46619057, 237397306, 
193760535, 174494351, 223891867 


8 


925672665, 545242888, 131329093, 1388111142, 3800772080, 2494264851, 2084807742, 
3004829231, 432983728, 1265186469, 1437667228, 2976213301, 766141292, 455842134, 
3399920157, 3380112474, 2022530125, 1674167067, 3521287529, 517486905, 1394045560 


9 


6263028814, 17380851458, 66723786135, 13185706648, 61523714894, 11518454374, 
28777615826, 31813316858, 7484361626, 45216625720, 10357786275, 32845624107, 
64532207782, 48511720424, 46756144798, 24818822896, 53243861654, 27484036889 



A comparison with the results obtained in the section before, reveals that 
the genetic algorithm is not as useful as desired. 

On one hand, the only improvements are obtained for the values t = 8,9, 
and they are not significantly impressive. Moreover, it seems that quality (in 
terms of the size of the cliques obtained) relays on the extension function 
rather than in the genetic procedure itself. 

On the other hand, each run of the extension function is very time-consu- 
ming, so the required time for executing a full run of the genetic algorithm 
grows drastically. And there is no a dramatic increase in the size of the obtained 
cliques in return. 

It would be desirable to look for a faster way of extending cliques, attending 
to the particular properties of our graph Gt- We tackle with this question in 
the next section. 



4.2 A fast heuristic search 

In order to get a faster heuristic search, we need to have a deeper look at the 
adjacency relations between the vertices in Gt- 

We first translate Proposition [T] (which characterizes the set of s- vectors 
orthogonal to a given fc-vector in terms of distributions of total coincidences 
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(0% , a.i 2 , a; 3 , cxi A )), in terms of distributions 12,13, h) of coincidences in Is 
(for the first and last quarters) and Os (for the second and third quarters). 

Proposition 5 The set of s-vectors orthogonal to a given k-vector corre- 
sponds to the full set of distributions of s-vectors satisfying tuples of coin- 
cidences in Is (first and fourth quarters) and Os (second and third quarters) 
{ili iii *3> ^4) such that i\ + i 2 + *3 + *4 = 2s + 2k — t. Furthermore, this is 
possible iff 2s + 2k — t > 0. 

Proof From Proposition [TJ we know that the set of s-vectors orthogonal to 
a given fc-vector is characterized by those distributions of total coincidences 
(a i± , a i2 , a i3 , a l4 ) such that = 2t 

Since a,=i — s — fc + 2i from Lemma the relation above comes to be 

At - 4k - 4s + 2zi + 2i 2 + 2i 3 + 2i 4 = 2t<^i 1 +i 2 + h + 14 = 2s + 2k-t. 

Now, on one hand, since < ij < min(A;, s), the value ii + i 2 + i 3 + «4 runs 
over the range < i± + i 2 + i 3 + i 4 < 4min(fc, s). 

On the other hand, since < s,fc < |_f_|, it is clear that 2s + 2k — t < 
2 min(&;, s). 

Thus, provided 2s + 2k — t > 0, this value is in the range valid for i\ + 
?2 + ?3 + 24 j and therefore there exists a distribution of coincidences in Is (at 
first and fourth quarters, i\ and i 4 respectively) and Os (at second and third 
quarters, i 2 and i 3 respectively), such that i\ + i 2 + i 3 + i 4 = 2s + 2k — t. 

m 

Now it is apparent that not all possible values s in the range < s < |_|J 
do provide s-vectors orthogonal to a given k- vector. 

Proposition 6 Fixed a k-vector v ; there exist s-vectors orthogonal to v iff 

se [[|l-fc,L|J]. 

Proof The upper bound is given in Lemma [3J 

On the other hand, from Proposition [SJ there exist s-vectors orthogonal 
to a given fc-vector iff < 2s + 2k — t. Consequently, s > | — fe, that is, 



Furthermore, we may straightforwardly precise the number of s-vectors 
orthogonal to a given k- vector, for some fixed s e [|~|] — k, |_|J]. 

Lemma 6 Fixed a valid distribution i 2 , 23, 14), the number of s-vectors 
orthogonal to a given k-vector is given by the expression: 



/k\ (t-k \ (k \ (t-k \ (k\ (t-k \ (k\ (t-k \ 
\i\j \s-i\) \i 2 ) \s -i 2 J \i 3 J \s - i 3 J \i 4 J \s - i 4 J 
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Table 6 Distribution of s-vectors orthogonal to a fc-vector. 





t = 3 




k = 


fc = 1 


s = 








s = 1 





32 





t = 4 


t = 5 




fc = 


fc = 1 


fc = 2 


fc = 


fc = 1 


fc = 2 


s = 








1 











s = 1 





81 


96 








216 


s = 2 


1296 


486 


454 





3456 


3240 





t = 6 




t 


= 7 






fc = 


fc = 1 


fc = 2 


fc = 3 


fc = 


fc = 1 


fc = 2 


fc = 3 


s = 











1 














8=1 








256 


486 











768 


s = 2 





10000 


14688 


15795 








40000 


57024 


s = 3 


160000 


60000 


49920 


47148 





480000 


440000 


422208 





t = 8 




fc = 


fc = 1 


fc = 2 


fc = 3 


fc = 4 


s = 














1 


8=1 











625 


1536 


s = 2 








50625 


147000 


183904 


s = 3 





1500625 


2352000 


2601000 


2655744 


8 = 4 


24010000 


9003750 


7183750 


6483750 


6297030 





t = 9 




fc = 


fc = 1 


fc = 2 


fc = 3 


fc = 4 


s = 

















s = 1 














2000 


s = 2 











243000 


464000 


s = 3 








7203000 


11210000 


12912000 


s = 4 





76832000 


69629000 


65367000 


63430000 









t = 


10 








fc = 


fc = 1 


fc = 2 


fc = 3 


fc = 4 


fc = 5 


s = 

















1 


8=1 














1296 


3750 


8 = 2 











194481 


858600 


1200625 


s = 3 








9834496 


32773650 


48326400 


53560000 


s = 4 





252047376 


407209600 


453248775 


468312600 


472003750 


s = 5 


4032758016 


1512284256 


1180754176 


1041640236 


978746976 


960098756 



The following tables show the number of s-vectors orthogonal to a given 
fc-vector, for 3 < t < 10, < fc < |_f J , and < s < |_|J . 

In particular, these results suggest that large cliques in Gt should consist 
of fc- vectors, for large values of fc, close to |_|J • 

This seems to be so, as the calculations below suggest. 

For each 3 < t < 9, we choose at random a Hadamard matrix of or- 
der At from Sloane's online library (52], say had. 12, hadl6.4, had20 .hall .n, 
had24.pal, had28.pal2, had32.pal, had36.pal2. 

We now normalize these matrices, by means of the following algorithm. 
Notice that since just negation and permutation of columns are used, the 
Hadamard character of the matrix is preserved. 
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Algorithm 3 Hadamard normalization 

— Negate those columns consisting of a first negative entry. 

— Now, locate those columns i, 1 < i < 2t, consisting of a second negative 
entry. Locate those columns j, 2t + 1 < j < At, consisting of a second 
positive entry. Interchange them. 

— Proceed as the step before, now by quarters. As a result, you will obtain a 
normalized Hadamard matrix. 

Now we randomly fix a |_f J -vector among the rows of these matrices (for 
instance, the first such occurrence). The table below shows the distribution 
of the values s, for the s-vectors of the remaining rows. In addition, we also 
include the distribution of (not ordered!) total coincidences (a il , . . . , a i4 ). 

Table 7 Rows in Hadamard matrices are fc-rows, for k G {|_f J — b Lf J}- 



t 


row 


*= UJ 


* = Lil-i 


#{total coincidences} 


3 


4 


8 





(1, 1,1,3) -y 8 










(2,2,2,2) ->4 


4 


6 


8 


4 


(0, 2,2,4) ->4 
(1, 1,3,3) -> 4 


5 


5 


15 


1 


(1.3.3.3) ->■ 12 
(1, 1,3,5) -> 3 

(2.2.2.4) -> 1 


6 


4 


11 


9 


(2, 2,4,4) 9 
(2,2,2,6) 1 
(0,4,4,4) 1 
(1,3,3,5) -> 6 
(3, 3, 3, 3) -> 3 










(3,3,3,5) ->■ 16 
(1,3,5,5) -> 3 


7 


5 


21 


3 


(1,3,3,7) -> 2 
(2,2,4,6) 2 
(2,4,4,4) 1 


8 


4 


12 


16 


(2.4.4.6) -»■ 12 
(3,3,5,5) -»■ 12 
(1,5,5,5) -> 2 

(3.3.3.7) -> 2 


9 


4 


26 


6 


(3.5.5.5) -»■ 20 
(1,5,5,7) -> 6 

(2.4.6.6) ->■ 6 



The table above suggests that one should focus on fc-vectors for k £ { [§J — 
1, |_f J}- Furthermore, the vector of total coincidences per quarter uses to be 
homogeneously distributed. 

With these ideas at hand, we next design a fast heuristic searching for (or 
eventually extending) cliques in G t . 
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Algorithm 4 Fast heuristic for extending cliques in Gt. 

Input: a clique C in G t 

Output: a clique C in Gt containing C 

C' = C 

for k from [f J to [_|J — 1 step —1 do 
iter = 

while iter < t do 

{bool,v} =buildgrapas(C", k) 

If boot then iter = 0; C — C' U {v} else iter = iter + 1 

od 

od 

C' 

The function buildgrapas tries to construct an s-vector quarter by quarter 
(in a random ordering), attending to the following aspects: 

— Select a number of total coincidences for the quarter, according to the 
range valid at this step (i.e. such that equation (P) can be satisfied), and 
with probability proportional to the number of its appearances in the set 
of solutions described in Corollary [2] 

— Once the desired number of total coincidences has been fixed, a genetic 
procedure is performed, for constructing a valid quarter (i.e. such that ([T]) 
can be satisfied). This heuristic consists of populations of At individuals. 
If no valid quarter is found after At generations, the search ends with a 
failure. In this case, the last quarter constructed so far is deleted, and the 
process goes on from this point. This situation is limited to occur at most 
t times. 

— This search is performed at most 10 times. If no valid vector is constructed 
after these 10 attempts, the search stops and a False boolean is returned. 

The table below shows, for every 2 < t < 10, the number of essays which 
have been executed looking for cliques in Gt, the average time required in 
these calculations, the average size of these cliques, the largest size found so 
far, the time required in this calculation and one instance among the largest 
cliques already found. 

Table 8 Results obtained from Algorithm 4. 



t 


Essays 


Av.Time 


Av.Sizc 


Lar.S. 


Time 


Clique 


2 


10 


1.4" 


5 


5 


1.4" 


86,101,149,89,60 


3 


10 


0.565" 


9 


9 


0.5" 


1452, 2396, 1393, 874, 756, 2482, 921, 
1242, 2281 


1 


10 


19.603" 


11 


12 


17.28" 


25542, 15462, 13769, 39626, 50538, 
22099, 38294, 22188, 22876, 52421, 
22947, 27802 
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t 


Essays 


Av.Timc 


Av.Size 


Lar.S. 


Time 


Clique 














193425, 586090, 808611, 420073 














350674, 408358, 222834, 109987, 


5 


10 


27.369" 


9 


17 


76.924" 


315192, 341833, 604856, 662897, 
171722, 308468, 121445, 218540, 
552900 














13215089, 13015273, 10053835, 12875531, 


6 


10 


51.38" 


7.6 


10 


56.87" 


12954262, 6768979, 3454163, 9730420, 




3324617, 6759253, 5685752, 8874722, 
999010,906181 














159860692, 161016643, 73886147, 


7 


10 


1'23" 


8 


9 


1'46" 


142481171, 173143250, 170760901, 
21953176, 43986594, 203259148 














866473426, 1497876124, 1273533381, 


8 


10 


2' 11" 


7.6 


9 


2'53" 


1697995341, 1439971764, 3784746441, 
2345981979, 2309832360, 242628440 














56095030680, 38163367817, 














41103334725, 54854789721, 


9 


10 


3'02" 


7.8 


9 


3'05" 


22744355553, 45279148512, 
4785499937, 52133944916, 
60209197830 



Although the size of the cliques obtained so far are smaller than those 
constructed by the precedent procedures, it is a remarkable fact that this 
algorithm is substantially faster. In fact, this procedure should be considered as 
an extension function for cliques better than a procedure itself for constructing 
cliques starting from the empty graph. 

This idea is supported by the calculations showed in the table below, where 
cliques C of size \C\ <2t in G t constructed by the procedures described in [5J 
[TP] (after normalization by Algorithm [3]) are extended to larger cliques C (of 
size \C'\ > 2t, more than a half of a full Hadamard matrix!) with Algorithm 
® 

Table 9 Algorithm 4 applied to PH mX £t in |10l produces f-ff( m + n ) X 4 4 , with m + n > 2t. 



t 


\c\ 


\C\ 


time 


added vertices 


1 


5 


12 


17.97" 


21930, 50745, 25500, 13107, 
37740,42330,51510 


5 


5 


7 


19.38" 


118659,341714 


6 


9 


15 


1'07" 


3099915, 9123660, 8841105, 
10606050, 4844385, 4692870 


7 


9 


9 


1'03" 




8 


13 


17 


2? 51" 


1923517785, 3032697660, 
1695521520, 2956808130 



Notice that for t = 7, the input clique has not been extended to a larger 
one. We suspect that the input clique is maximal, and therefore a larger clique 
containing it could not exist. 
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5 Conclusions 

In this paper we have described three algorithms looking for pretty large partial 
Hadamard matrices (i.e. about a third of a full Hadamard matrix), in terms 
of cliques of the Hadamard Graph Gt- 

The hrst one (Algorithm 1) performed some kind of local exhaustive search, 
and consequently is expensive from the time consuming point of view. So we 
decided to design some heuristic for constructing partial Hadamard matrices. 

Our first approach (Algorithm 2) consisted in an adaptation of Singh and 
Gupta's genetic algorithm for the Maximum Clique Problem. Unfortunately, 
it did not work properly, since it used Algorithm 1 for extending cliques, and 
consequently was very expensive in time as well. 

Algorithm 4 prioritizes the required processing time better than the final 
size of the partial Hadamard matrix to be obtained. Experimental results show 
that this algorithm may output pretty large partial Hadamard matrices (larger 
than half a full Hadamard matrix!), provided a suitable initial clique is given 
as input data. 

All the algorithms that we have presented here are based on the properties 
of the Hadamard Graph Gt which have been described in Section 2. 

It would be an interesting question whether different techniques and meth- 
ods could be considered for designing alternative algorithms searching for large 
partial Hadamard matrices. For instance, one could ask about the techniques 
and methods which have been shown to be useful when manipulating full 
Hadamard matrices. 

Unfortunately, this will not be the case, in general. For instance, consider 
the case of the cocyclic approach. 

More concretely, the cocyclic framework has arised as a promising way to 
construct (cocyclic) Hadamard matrices [T4iriMll4] • One could ask whether the 
cocyclic framework is also a good place to look for partial Hadamard matrices. 
Actually, this is not the case. 

Proposition 7 The depth of any partial Hadamard matrix which is a subma- 
trix of a cocyclic matrix Mf is at most a half of the size of Mf. 

Proof Attending to the proof of the cocyclic Hadamard test in [TB] (see Lemma 
1.4 on p. 281), fixed a multiplicative group G — {gi = 1, . . . , g n } and a cocyclic 
matrix Mf = {f(gi,gj)) over G, rows gi ^ gk in Mf are orthogonal if and 
only if the summation of the row gig^ 1 ^ gi) is zero (and consequently, the 
summation of the row gkg^ 1 as well). 

If a row gk ^ g\ in Mf fails to sum zero, then, for each 1 < i < n, the pair 
of rows {gi,gk~ 1 gi} (and also {gi,gkgi}) fails to be orthogonal. By partitioning 
the rows of Mf into pairs of the type {gt, g^gi}, it turns out that such a pair 
contributes at most one row to a partial Hadamard matrix included in Mf. 
Consequently, the depth of any partial Hadamard matrix which is a submatrix 
of Mf is at most as claimed. 
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It would be interesting to think about the way in which /c-vertices in Gt 
could be combined in order to get larger cliques. 

Nevertheless, it would be also interesting to investigate whether improved 
versions of the algorithms described in this paper could be designed, attending 
to other considerations. 
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